Interactive data Analysis: The Control Project

نویسندگان

  • Joseph M. Hellerstein
  • Ron Avnur
  • Andy Chou
  • Christian Hidber
  • Christopher Olston
  • Vijayshankar Raman
  • Tali Roth
  • Peter J. Haas
چکیده

51 Interactive Data Analysis: The Control Project D ata analysis is fundamentally an iterative process in which you issue a query, receive a response, formulate the next query based on the response, and repeat. You usually don't issue a single, perfectly chosen query and get the information you want from a database; indeed, the purpose of data analysis is to extract unknown information, and in most situations there is no one perfect query. 1 People naturally start by asking broad, big-picture questions and then continually refine their questions based on feedback and domain knowledge. 2 Consider repeating this process several times over, sifting through many more results, and you have an idea of why using advanced data analysis tools is so complex. Composing Structured Query Language (SQL) queries for decision-support database management systems (DBMSs) isn't easy, and even users of graphical query tools find it difficult to generate insightful queries. Although data-mining systems typically don't provide complicated query languages, to use these systems you need to choose a suitable mining algorithm and carefully tune various algorithm-specific parameters such as support and confidence for association rule mining, thresholds for clustering, training sets for classification , and so on. These usability problems increase the number of iterations in the analysis process; you have to try algorithms with different parameters until you find one that produces useful results. In addition, many of these tools require complicated, time-consuming setup phases before they can be used at all. Most research in the areas of decision support, data visualization, statistics, data mining and knowledge discovery has concentrated on improving a single iteration of the analysis process. Some work has focused on improving the quality of a particular analysis result or on reducing the time it takes for each analysis step or algorithm to provide a complete response. These fields have progressed greatly, but this research focus ignores a basic invariant in computing: Full-scale data analysis will always be slow. As Greg Papadopoulos, chief technology officer at Sun, points out, the appetite for data collection, storage, and analysis is outstripping Moore's law, meaning that the time required to analyze massive data sets is steadily growing. To date, the result is a worst-case mode of human-computer interaction: Data analysis is a complex process involving multiple, time-consuming steps, and a poor or erroneous choice of inputs is not noticeable until results return at the end of a given step. …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relation Between Imprecise DESA and MOLP Methods

It is generally accepted that Data Envelopment Analysis (DEA) is a method for indicating efficiency. The DEA method has many applications in the field of calculating the relative efficiency of Decision Making Units (DMU) in explicit input-output environments. Regarding imprecise data, several definitions of efficiency can be found. The aim of our work is showing an equivalence relation between ...

متن کامل

EFL Teachers’ Identity Construction through a Reflection Consciousness-Raising Interactive Workshop

As part of a large-scale project, the current qualitative study investigated the possible contribution of a consciousness-raising interactive workshop (as a form of professional development activity) to 30) 22 female and 8 male) Iranian EFL teachers’ professional identity construction. Thirty Iranian EFL teachers were asked to write two reflective journals (one individually and one collectively...

متن کامل

The Impact of Management Control Systems on Contemporary Management Accounting Practices in the Public Sector

The purpose of this paper is to investigate the effect of the interactive and diagnostic use of management control systems on the adoption and success of contemporary management accounting practices in the public sector. Contemporary management accounting practices includes: benchmarking, activity-based costing, the balanced scorecard, value chain analysis, total quality management, key perform...

متن کامل

Design of modern interactive and ergonomic home air purifier

Introduction: The subject of this research is having healthy air and its challenge is air purification to have this type of air. Healthy air is free of any pollutants, including odors, harmful gasses, dust, and viruses, especially corona. This healthy air is provided by a purifier device. One of the problems of metropolises is the lack of healthy air, which is one of the most important human ne...

متن کامل

A fuzzy multi-objective model for a project management problem

In this research, the multi-objective project management decision problem with fuzzy goals and fuzzy constraints are considered. We constitute α-cut approach and two various fuzzy goal programming solution methods for solving the Multi-Objective Project Management (MOPM) decision problem under fuzzy environments. The Interactive fuzzy multi-objective linear programming (i-FMOLP) and Weighted Ad...

متن کامل

The Effect of Interactive Management Style on Academic Adjustment, Math Anxiety and Academic Engagement of Students

Purpose: The purpose of this study was to examine the effect of interactional management style on academic adjustment, mathematical anxiety, and academic engagement in elementary sixth grade students. Methodology:  This research is applied in terms of purpose and quasi-experimental in terms of method and with pre-test-post-test design with experimental group and control group. The statistical ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Computer

دوره 32  شماره 

صفحات  -

تاریخ انتشار 1999